Rehearsal Audio Stream Segmentation and Clustering
نویسندگان
چکیده
Rehearsal recordings are valuable for music ensembles to improve their performance. However, since rehearsal recordings are typically hours long and contain disordered, disrupted, and unclassified musical content (mixed with different sections of different music pieces, conductor’s talk, all kinds of noise between rehearsal intervals and so on), they’re hard to use directly. For example, recordings for a particular song are not easily found. This paper presents a variety of approaches to extract all the musical segments from the disarrayed rehearsal audio stream and then cluster the segments belonging to the same piece of music together. The procedures discussed herein have the potential to be applied in a large scale music database to accomplish the music information retrieval (MIR) tasks.
منابع مشابه
Segmentation, Clustering, and Display in a Personal Audio Database for Musicians
Managing music audio databases for practicing musicians presents new and interesting challenges. We describe a systematic investigation to provide useful capabilities to musicians both in rehearsal and when practicing alone. Our goal is to allow musicians to automatically record, organize, and retrieve rehearsal (and other) audio to facilitate review and practice (for example, playing along wit...
متن کاملSegment Generation and Clustering in the HTK Broadcast News Transcription System
This paper describes the segmentation, gender detection and segment clustering scheme used in the 1997 HTK broadcast news evaluation system and presents results on both the unpartitioned 1996 development and the 1997 evaluation sets. The stages of our approach are presented, namely classification, segmentation and gender detection, gender relabelling, and clustering of speech segments. The eval...
متن کاملUnsupervised audio stream segmentation and clustering via the Bayesian information criterion
In this paper, we propose an eÆcient approach for unsupervised audio stream segmentation and clustering via the Bayesian Information Criterion (BIC). The proposed method extends an earlier formulation by Chen and Gopalakrishnan[1]. In our segmentation formulation, Hotelling's T -Statistic is used to pre-select candidate segmentation boundaries followed by BIC to make the segmentation decision. ...
متن کاملRobust Unsupervised Speaker Segmentation for Audio Diarization
Audio diarization Reynolds & Carrasquillo (2005) is the process of partitioning an input audio stream into homogeneous regions according to their specific audio sources. These sources can include audio type (speech, music, background noise, ect.), speaker identity and channel characteristics. With the continually increasing number of larges volumes of spoken documents including broadcasts, voic...
متن کاملSegmentation, Classification and Clustering of an Italian Broadcast News Corpus
This work reports on preliminary activity at ITC-irst on the problem of acoustic segmentation, classification and clustering of an Italian audio broadcast news corpus. The approach is based on the following stages. First, the input data stream is segmented by detecting spectral changes through the Bayesian Information Criterion (BIC). Second, segments are classified in terms of acoustic conditi...
متن کامل